Meaning Representation for Knowledge Sharing in Practical Machine Translation

نویسنده

  • Kavi Mahesh
چکیده

Knowledge-based machine translation can be viewed as the problem of extracting and representing the meaning of a text and generating a translation in a target language using the meaning representation. Meaning extraction requires the integration of information present explicitly in a text with common sense and domain knowledge given to the system. Thus, integrating linguistic knowledge of each language with general world knowledge is a central problem in machine translation , especially when more than two languages are involved. In this article we consider the design of a meaning representation that enables language-speciic lexicons to share knowledge with a language-independent world model. We illustrate how the underlying core meaning representation can be enhanced in three diierent ways to arrive at lexical, ontological, and text meaning representations. The meaning representations presented here have been implemented in the Mikrokos-mos machine translation system and used to represent Span-ish and Japanese lexicons in addition to a broad-coverage ontological world model. Most human knowledge is represented and communicated via natural languages in spite of relatively recent eeorts such as Cyc (Lenat and Guha, 1990) to \computerize" common sense knowledge. Extracting the meaning of a natural language text involves integrating the information present overtly in the text (which is typically incomplete) with the linguistic and world knowledge sources given to the system. The design of such a knowledge based natural language processing (NLP) system is inseparable from issues of sharing knowledge between linguistic and world knowledge sources, and, in turn, sharing knowledge representations between lexical, textual, and ontological meaning representations. Machine translation (MT), especially when it involves more than two languages, is an NLP problem that provides a particularly interesting context to study knowledge shar-0 ing between multiple knowledge bases containing diierent types of knowledge. In multilingual knowledge based machine translation (KBMT), the meaning of a source text is extracted using both linguistic knowledge of the source language and general world knowledge. The resulting meaning representation is then used to generate translations in one or more target languages. Language generation also requires both world knowledge and knowledge of the target language. The internal representation of the meaning of a text must be independent of particular languages to enable translation to or from any of a set of languages. A fundamental issue in the design of multilingual MT (and other NLP) systems is how to represent (i) linguistic knowledge for each language, (ii) general world knowledge, and …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology Development for Machine Translation: Ideology and Methodology

In the Mikrokosmos approach to knowledge-based machine translation, lexical representation of word meanings as well as text meaning representation is grounded in a broad-coverage ontology of the world. We have developed such a language-neutral ontology for the purpose of machine translation in the situation of the Mikrokosmos project. In order to acquire a fairly large ontology with limited tim...

متن کامل

New developments in knowledge-based machine translation

Recent experience in Knowledge-Based Machine Translation (KBMT) suggests that systems adhering to this paradigm can be useful not only in very small domains and limited sublanguages. With special attention paid to acquisition of large knowledge bases and with the advent of new tools (including representation languages, human-computer interfaces, database management systems, etc.) the practicali...

متن کامل

A Lexicon for Knowledge-Based MT

In knowledge-based machine translation (KBMT), the lexicon can be specified and acquired only in close connection with the specification and acquisition of the world model (ontology) and the specification of the text meaning representation (interlingua) language. The former supplies the atoms for the specification of text meaning and provides world knowledge to support the inference processes n...

متن کامل

Text Meaning Representation For Chinese

This paper describes text meaning representation for Chinese. Text meaning representation is composed of a set of ontological concept instances along with ontological links among them. It integrates lexical, textual and world knowledge into a single hierarchical framework. In NLP application it serves as an interlingual representation for various processing. The methodology and implementation o...

متن کامل

Lexicon, Ontology, and Text Meaning

A computationally relevant theory of lexical semantics must take into consideration both the form and the content of three different static knowledge sources -the lexicon, the ontological domain model and a text meaning representation language. Meanings of lexical units axe interpreted in terms of their mappings into the ontology and/or their contributions to the text meaning representation. We...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996